8 19 1

Daniel Korat

danielkorat

AI & ML interests

Inference acceleration, Low-resource NLP, Few-shot learning

Recent Activity

updated a dataset about 2 hours ago

danielkorat/dummy

published a dataset about 2 hours ago

danielkorat/dummy

upvoted an article 26 days ago

Introducing HELMET

View all activity

Organizations

danielkorat's activity

upvoted an article 26 days ago

Article

Introducing HELMET

and 6 others •

Apr 16

• 29

upvoted an article 2 months ago

Article

Speeding Up LLM Decoding with Advanced Universal Assisted Generation Techniques

and 8 others •

Mar 24

• 18

upvoted a paper 4 months ago

SQuARE: Sequential Question Answering Reasoning Engine for Enhanced Chain-of-Thought in Large Language Models

Paper • 2502.09390 • Published Feb 13 • 16

upvoted a paper 6 months ago

FastDraft: How to Train Your Draft

Paper • 2411.11055 • Published Nov 17, 2024 • 11

upvoted 3 articles 8 months ago

Article

Assisted Generation: a new direction toward low-latency text generation

•

May 11, 2023

• 64

Article

Blazing Fast SetFit Inference with 🤗 Optimum Intel on Xeon

and 5 others •

Apr 3, 2024

• 11

Article

Faster Assisted Generation with Dynamic Speculation

and 6 others •

Oct 8, 2024

• 47

upvoted an article 10 months ago

Article

SetFit: Efficient Few-Shot Learning Without Prompts

and 5 others •

Sep 26, 2022

• 28

upvoted a paper 10 months ago

RAG Foundry: A Framework for Enhancing LLMs for Retrieval Augmented Generation

Paper • 2408.02545 • Published Aug 5, 2024 • 38

upvoted an article 11 months ago

Article

Our Transformers Code Agent beats the GAIA benchmark!

and 1 other •

Jul 1, 2024

• 88

upvoted an article 12 months ago

Article

Training and Finetuning Embedding Models with Sentence Transformers v3

•

May 28, 2024

• 223

upvoted 2 papers about 1 year ago

Accelerating Speculative Decoding using Dynamic Speculation Length

Paper • 2405.04304 • Published May 7, 2024 • 2

Distributed Speculative Inference of Large Language Models

Paper • 2405.14105 • Published May 23, 2024 • 19

upvoted 2 articles about 1 year ago

Article

Building Cost-Efficient Enterprise RAG applications with Intel Gaudi 2 and Intel Xeon

and 8 others •

May 9, 2024

• 12

Article

Introducing the Open Leaderboard for Hebrew LLMs!

and 3 others •

May 5, 2024

• 45

upvoted a paper over 1 year ago

Improving Classification Performance With Human Feedback: Label a few, we label the rest

Paper • 2401.09555 • Published Jan 17, 2024 • 6

upvoted a paper almost 2 years ago

H_2O: Heavy-Hitter Oracle for Efficient Generative Inference of Large Language Models

Paper • 2306.14048 • Published Jun 24, 2023 • 12